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The permanent of an m x n matrix A = {aij), with m < n, is defined as 



where the summation is over all injections a from M = {1,2, ... ,m} to 
N = {1,2, . . . ,n}. While studies on permanents - since their introduction 
in 1812 by Binet [3] and Cauchy [5j - have focused on matrices over fields 
and commutative rings, we generally only assume the entries are from some 
semiring, that is, multiplication need not commute and additive inverses need 



In this note, we give simple algorithms to evaluate the permanent of 
a given matrix. In arbitrary semirings, we apply Bellman-Held-Karp type 
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dynamic programming [11121 [7] across column subsets; in commutative semir- 
ings, a "transposed" variant is shown to be considerably faster. In arbitrary 
rings, the starting point is Ryser's classic algorithm |T3] that we manage to 
expedite for rectangular matrices, but that remains the fastest known algo- 
rithm for square matrices; again, in commutative rings, a transposed variant 
is shown to be substantially faster for rectangular matrices. 

To state our main results, we take the time requirement of an algorithm 
as the number of additions and multiplications it performs, while the space 
requirement is taken as the maximum number of semiring elements that it 
needs to keep simultaneously in memory at any point in the computation. 
Also, denote by (j^^) the sum of the binomial coefficients (q) + (i) + ' ' ' + if) ■ 

Theorem 1. The permanent of any m x n matrix, m <n, can he computed 

(i) in semirings in time 0^m(j^^)j and space 0^(j^J^)j; 

(ii) in commutative semirings in time 0{m{n — m + 1)2™") and 
space 0((n — m + 1)2™"); 

and 



(Hi) in rings in time Oyn[^^^^j and space 
(iv) in commutative rings in time 0{{mn — + n)2™) and space 0{n). 

All previous works we are aware of on evaluation of permanents assume 
commutativity, besides perhaps what is implicit in Ryser's formula, see ([T]) 
below. For commutative rings, our bounds improve upon the state-of-the- 
art achieved in a series of works based on arguably more involved techniques: 
Using the Binet-Minc formulas [12], Kawabata and Tarui [9] presented an 
algorithm that runs in time 0(n2'^ + 3™) and space 0(n2''"). Recently, Vas- 
silevska and Williams [11] took a different approach and obtained improved 



bounds 0{mn^2"^) and 0(n^2™), respectively. Finally, by a yet different, 
algebraic approach, Koutis and Williams [H] further improved these bounds 
to poly(m, ?T,)2™ and poly(m,ra). For commutative semirings, Vassilevska 
and Williams [14] gave a Gurevich-Shelah [B] type recursive partitioning 
algorithm running in time poly (m, 72)4"^ and space poly (m,n). Koutis and 
Williams [TT] presented bounds comparable to Theorem[T]^ii) using a dynamic 
programming algorithm similar to ours but in an algebraic guise. 

We begin without any further assumptions about the semiring and adopt 
the standard dynamic programming treatment of sequencing problems. That 
is, the algorithm tabulates intermediate results a{i, J) for sets J C N of size 
i, given by the recurrence 

a;(O,0) = 1, a{i,J) = '^^a{i — 1, J \ {j}) aij for i = 1, 2, . . . , m . 

Here J corresponds to the image (j({l,2, . . . ,i}) of the injection a, and it 
is easy to show that the permanent of A is obtained as the sum of the 
terms a{m, J) over all J C of size m. Straightforward analysis proves 
Theorem [l](i). 

In commutative semirings, we may transpose the previous algorithm, as 
follows. The idea is to go through the column indices j one by one, associating 
j with either one row index i not already associated with some other column, 
or associating j with none of the rows. Formally, for all / C M define 
recursively 

a(0,O) = 1 , a(/,0) = for / ^ , 

= Oi{IJ - 1) + ^a{I \{i},j - l)aij for j = 1, 2, . . . , n . 
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Here / corresponds to the preimage cr^^({l, 2, . . . ,j}) of the injection a. One 
can show easily by induction that a{I,j) equals the permanent of the sub- 
matrix of A consisting of the rows / and columns {1, 2, . . . , j}; in particular, 
a(M, n) = per A. To obtain the bounds in Theorem [l](ii), it remains to 
observe that a(/, j) needs to be computed only if \I\ < j < n — m + \I\, 
and thus, the time and space requirements are 0(m(n — m + 1)2"^) and 
0((?2 — m + 1)2'"), respectively. 

In rings, we start with Ryser's inclusion-exclusion formula. Denote by 
Qix the partial row sum of the entries aij with j G X. Ryser [13] found that 



(While Ryser's original derivation is for fields, it immediately extends to 
arbitrary rings.) Visiting the sets X, for instance, in the lexicographical 
order, the terms aix can be computed in an incremental fashion, each in 
constant amortized time. Thus the permanent can be evaluated in time 



efficient way to evaluate the permanent. 

But, when m is much less than n we can, in fact, do significantly better. 
For any subset of rows I (1 M and any subset of columns J C A^, let Ajj 
denote the corresponding submatrix of A. For simplicity, assume m is even, 
and denote i^' = {1, 2, . . . , m/2} and L = {m/2 + 1, m/2 + 2, . . . , m}. Now, 
we may write per A as the sum of the products perAxp per Alq over all 
disjoint pairs of subsets P,Q N with \P\ = \Q\ = m/2. While computing 

the sum over the (J/2) (";;72 ') such pairs (P, Q) may look inadvisable at first 

glance, the following observation changes the picture. 




XCN 
\X\<m 



(1) 




space 0(m). For square matrices this remains the most 
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For a set family 3", denote by I?" the family of sets in 5" and their subsets. 

Theorem 2 (Bjorklund et al. |4], Kennes |I10]). Let f and g be two func- 
tions from the subsets of a finite set U to a ring R. Then, 

S,TCU XCU SDX TDX 

SnT=0 

Furthermore, if 3^ and S are given families of subsets of U such that f and 
g vanish outside 5" and S, respectively, then the sum ^ can be computed 
with 0( \U\ ( \13^\ + ||S| )) ring and set operations, and with a storage for 
0(||3^| + I IS I) ring elements. 

To apply this result, we first note that the cardinality of 
1{P N : \P\ = m/2} is (1^/2)- Second, note that the permanent perAxp, 
for all P C of size m/2, can be computed in time o(^m(^^^^^^ and space 
^((j,m/2))' similarly for the permanents peiAiQ. Combining these bounds 
yields Theorem [Tj^iii). We also note without proof that the space requirement 
can be reduced to 0(m) at the cost of an extra factor of 3™/^ in the time 
requirement; the idea is the same as what we have recently used to count 
paths and packings [1]. 

Finally, in commutative rings we may transpose Ryser's formula in ana- 
logue to the transposed dynamic programming algorithm for commutative 
semirings. To this end, denote by axj the partial column sum of the entries 
aij with i G X. Then we may write 

per A = J2 (-1)"""" E <i ^X2 ■ ■ ■ a'xn , 

XCM p 

where the inner-most summation is over all binary sequences p = P1P2 ■ ■ - Pn £ 
{0, 1}" with pi + P2 + ■ ■ ■ + Pn = To see this, consider arbitrary row 
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indices ii, 12, . . . ,im ^ M and column indices ji, j2, ■ ■ ■ ,jm ^ N. Note that 
the expanded sum contains a unique term of the form cai^j^ai^j^ - ■ ■ ai^j^ 
if and only if the indices ji,j2, ■ ■ ■ ,jm are distinct; the coefficient c is the 
sum of the terms (— 1)™-"!"^! over all X C M that contain the row indices 
ii,i2, . . . ,im- If all the row indices are distinct, there is only one such set 
X, and the coefficient correctly equals (— = 1. Otherwise, there 
are equally many such subsets X of odd and even size, and the coefficient 
correctly vanishes. 

To analyze the time and space complexity, we note that, for any fixed 
X C M, the summation over the binary sequences p can be performed using 
simple dynamic programming in time 0{n + m{n — m)) and space 0(n)j^ 
Here we assume that the sets X are visited in a suitable order such that each 
partial column sum can be updated in an incremental fashion in constant 
amortized time. Theorem [Tj^iv) follows. 

We end by discussing the role of commutativity. With the given definition 
of permanents. Theorem [T] suggests that commutativity is crucial for efficient 
evaluation of permanents. However, we point out that with the following 
transposed definition, the bounds in Theorem [l](ii, iv) actually hold without 
the assumption of commutativity: For an injection a from M to A^, denote 
by (Tj the ith largest element in the image a{M). Define the transposed 

^In the field of complex numbers, where one can evaluate discrete convolution via fast 
Fourier transforms, the time requirement can be reduced to 0(nlog^ m). We are not aware 
whether such improvement is possible in an arbitrary (commutative) ring. 
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permanent of an m x n matrix A — {aij) over any semiring as 
per' ^ = ^ a(T-i(f^i)o-i aa-i((T2)o-2 • • • a(T-i(o-^)o-^ , 

a 

where the summation is over all injections a from {1,2, ...,m} to 
{1,2, Note that in any commutative semiring, of course, per' A = 

per A. 
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